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1 Introduction 



Methods of symbolic dynamics are rather useful in the study of combinatorial properties 
of words, investigation of problems of number theory and theory of dynamical systems. 
Let M be a compact metric space, U C M be its open subspace, / : M — > M be a 
homeomorphism of the compact into itself, and x G M be an initial point. With the 
sequence of iterations, one can associate an infinite binary word 



which is called the evolution of point Xq. Symbolic dynamics investigates the interrelation 
between the properties of the dynamical system (M, /) and the combinatorial proper- 
ties of the word W n . For words over alphabets which comprise more symbols, several 
characteristic sets should be considered: U\, . . . , U n . 

By the direct problem of symbolic dynamics we mean the study of combinatorial prop- 
erties of the words generated by a given dynamical system; the inverse problem of symbolic 
dynamics refers to the investigation of the properties of the dynamical system, i.e., the 
properties of the compact set M and transform /, by the combinatorial properties of the 
word W. 

Inverse problems of symbolic dynamics related to the unipotent transformation of a 
torus were studied in paper [2]. 

Problems (both direct and inverse) related to the rotation of a circle bring about a 
class of words which are called Sturmian words. Sturmian words are infinite words over a 
binary alphabet which contain exactly n + 1 different subwords (factors) of length n for 
any n > 1. The following classical result is widely known. 

Theorem 1.1 (Equivalence theorem (|21j,[20j).) Let W be an infinite recurrent word 
over the binary alphabet A = {a, b}. The following conditions are equivalent: 

1. The word W is a Sturmian word, i.e., for any n > 1, the number of different 
subwords of length n that occur in W is equal to T n (W) = n + 1. 

2. The word is not periodic and is balanced, i.e., any two subwords u,v C W of the 
same length satisfy the inequality \\v\ a — \u\ a \ < 1, where \w\ a denotes the number 
of occurrences of symbol a in the word w. 

3. The word W = (w n ) is a mechanical word with irrational a, which means that there 
exist an irrational a, xq G [0,1], and interval U C S 1 , \U\ = a, such that the 
following condition holds: 



There are several different ways of generalizing Sturmian words. 

First, one can consider balanced words over an arbitrary alphabet. Balanced nonpe- 
riodic words over an n-letter alphabet were studied in paper [T7] and later in [IB]. In 
papers [3] and [5], a dynamical system that generates an arbitrary nonperiodic balanced 
word was constructed. 




a, /(»>(z ) G U 

b, fW(x )#U 




a, T a n (x Q )eU 

b, T a n (x )<£U 
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Second, generalization may be formulated in terms of the complexity function. Com- 
plexity function Tw{n) presents the number of different subwords of length n in the word 
W. Sturmian words satisfy the relation Ty/(n + 1) — Ty/(n) = 1 for any n > 1. Natural 
generalizations of Sturmian words are words with minimal growth, i.e., words over a finite 
alphabet that satisfy the relation Tw(n + 1) — Twin) = 1 for any n > k, where A; is a 
positive integer. Such words were described in terms of rotation of a circle in paper [6]. 
Note also that words whose growth function satisfies the relation lim^oo T(n)/n = 1 
were studied in paper [TJ. 

Words with complexity function T w {n) = 2n + 1 were studied by P. Arnoux and 
G. Rauzy ([HI [221 [23]), words with growth function T w {n) = 2n + 1 were investigated by 
G. Rote |24j. Consideration of the general case of words with linear complexity function 
involves the study of words generated by interval exchange transformations. The problem 
of describtion of such words was posed by Rauzy [23] . Words with linear growth of the 
number of subwords were also studied by the school of V. Berthe, S. Ferenczi, and Luca 
Q. Zamboni ([H], $\). They also investigate combinatorial sequences reated with interval 
exchange transformations. Paper [16] contains description of words generated by three- 
interval exchange transformations; paper [H] contains description of words generated by 
symmetric interval exchange transformations (such transformations are closely related to 
multi-dimensional continued fractions, and this relation looks extremely intersting). More 
precisely, they describe a combinatorial algorithm for generating the symbolic sequences 
which code the orbits of points under an interval exchange transformation on k intervals, 
using the symmetric permutation i — > k — i + 1 ([14]). 

More general result was obtained in the work [15]. In this paper give a complete 
characterization of those sequences of subword complexity (k — l)n + 1 which are nat- 
ural codings of orbits of ^-interval exchange transformations, (or, equivaently, interval 
exchange transformation, satisfying i.d.o.c. condition). 0. 

Interval exchange transformations T satisfies the infinite distinct orbit condition (or 
i.d.o.c. for short) if the k — 1 negative trajectories {T~ n (xj)} n >o ,(1 < i < k), of the 
discontinuities of T are infinite disjoint sets. The main result of paper [15] (wich is 
independent of our result) is following: 

Theorem 1.2 A minimal sequence W is the natural coding of a k-interval exchange 
transformation, defined by permutations (tt , tci) such that 7To _1 ({l, . . . , j}) ^ 7Ti _1 ({l, . . . , j}) 
for every 1 < j < k — 1, and satisfying the i.d.o.c. condition, if and only if the words of 
length one occurring in W are F\ = {1, . . . , k} and it satisfies the following conditions: 

1. If w is any word occurring in W, A{w), (resp. D{w)), the set of all letters x such 
thatxw, (resp. wx), occurs in W , is an interval for the order of H\, resp. tt q , 

2. If x G A(w), y G A(w), x < y for the order of "R\, z G D(xw), t G D(yw), then 
z <t for the order of 7To, 

3. If x £ A(w) and y G A(w) are consecutive in the order of 7r 1; D(xw) H D(yw) is a 
singleton. 

In this paper we study words generated by general piecewise-continuous transformation 
of the interval. Further we prove equivalence set words generated by piecewise-continuous 

lr This result wich is close to ours, was noticed to us by L. Zamboni, in order to mention that and make 
some other correstions we replaced our paper to new version 
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transformation and words generated by interval exchange transformation. This method 
get capability of descriptions of the words generated by arbitrary interval exchange trans- 
formation. 

This work is targeted to the following 

Inverse problem: Which conditions should be imposed on a uniformly recurrent word 
W in order that it be generated by a dynamical system of the form (7 ,T,U\, . . . , Uk), where 
I is the unit interval and T is the interval exchange transformation? 

The answer to this question is given in terms of the evolution of the labeled Rauzy 
graphs of the word W. The Rauzy graph of order k (the /c-graph) of the word W is the 
directed graph whose vertices biuniquely correspond to the factors of length k of the word 
W and there exists an arc from vertex A to vertex B if and only if W has a factor of 
length k + 1 such that its first k letters make the subword that corresponds to A and the 
last k symbols make the subword that corresponds to B. By the follower of the directed 
/c-graph G we call the directed graph Fol(G) constructed as follows: the vertices of graph 
Fol(G) are in one-to-one correspondence with the arcs of graph G and there exists an arc 
from vertex A to vertex B if and only if the head of the arc A in the graph G is at the 
notch end of B. The (k + l)-graph is a subgraph of the follower of the /c-graph; it results 
from the latter by removing some arcs. Vertices which are tails of (or heads of) at least 
two arcs correspond to special factors (see Section 2); vertices which are heads and tails 
of more than one arc correspond to bispecial factors. The sequence of the Rauzy /c-graphs 
constitutes the evolution of the Rauzy graphs of the word W. The Rauzy graph is said 
to be labeled if its arcs are assigned letters / and r and some of its vertices (perhaps, none 
of them) are assigned symbol "-" . 

The follower of the labeled Rauzy graph is the directed graph which is the follower 
of the latter (considered a Rauzy graph with the labeling neglected) and whose arcs are 
labeled according to the following rule: 

1. Arcs that enter a branching vertex should be labeled by the same symbols as the 
arcs that enter any left successor of this vertex; 

2. Arcs that go out of a branching vertex should be labeled by the same symbols as 
the arcs that go out of any right successor of this vertex; 

3. If a vertex is labeled by symbol then all its right successors should also be 
labeled by symbol "-" . 

In terms of Rauzy labeled graphs we define the asymptotically correct evolution of 
Rauzy graphs, i.e., we introduce rules of passing from /c-graphs to (k + l)-graphs. Namely, 
the evolution is said to be correct if, for all k > 1, the following conditions hold when 
passing from the /c-graph Gk to the {k + l)-graph Gk+i '■ 

1. The degree of any vertex is at most 2, i.e., it is incident to at most two incoming 
and outgoing arcs; 

2. If the graph contains no vertices corresponding to bispecial factors, then G n+ i co- 
incides with the follower D(G n ); 

3. If the vertex that corresponds to a bispecial factor is not labeled by symbol 
then the arcs that correspond to forbidden words are chosen among the pairs Ir and 
rJ; 
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4. If the vertex is labeled by symbol "-" , then the arcs to be deleted should be chosen 
among the pairs 11 or rr. 

The evolution is said to be asymptotically correct if this condition is valid for all k 
beginning with a certain k = K. The oriented evolution of the graphs means that there 
are no vertices labeled by symbol The main result of this work consists in the 

description of infinite words generated by interval exchange transformations (and answers 
a Rouzy question [23]): 

Main theorem. A uniformly recurrent word W 

1. is generated by an interval exchange transformation if and only if the word is pro- 
vided with the asymptotically correct evolution of the labeled Rauzy graphs; 

2. is generated by an orientation-preserving interval exchange transformation if and 
only if the word is provided with the asymptotically correct oriented evolution of the 
labeled Rauzy graphs. 

We have no restriction on the endpoint orbit. In special case of asymptotical subword 
growth of T\y (n) = n + const for all n > n$ we get an generalization of theorem 11.11 
i.e. description of all u.r. words with such growth property. The description of all (not 
nesesary u.r.) superwords such that T w {n) = n + const for all n > n see in [Bj. Note 
also that in all previous studies which is known for us interval exchange transformations 
are defined to be orientation preserving. 

The paper is organized as follows: in Section 2 we formulate the main definitions and 
facts about uniformly recurrent words, Rauzy graphs, and words generated by dynamical 
systems. In Section 3 we prove a theorem about necessary conditions for a word to be 
generated by interval exchange transformation. The next two sections are devoted to the 
proof of the sufficiency of these conditions. In Section 4 we prove that these conditions are 
sufficient for the word to be generated by a piecewise-continuous interval transformation. 
Finally, in Section 5 we prove that the sets of uniformly recurrent words generated by 
piecewise-continuous interval transformations and by the interval exchange transformation 
are equivalent. 

2 Main constructions and definitions 

2.1 Complexity function, special factors, and uniformly recur- 
rent words 

In this section we define the basic notions of combinatorics of words. By L we denote a 
finite alphabet, i.e., a nonempty set of elements (symbols). We use the notation A + for 
the set of all finite sequences of symbols or words. 

A finite word can always be uniquely represented in the form w — W\ ■ ■ -w n , where 
Wi G A, 1 < i < n. The number n is called the length of word w; it is denoted by \w\. 

The set A + of all finite words over A is a simple semigroup with concatenation as 
semigroup operation. 

If element A (the empty word) is included in the set of words, then this is actually the 
free monoid A* over A. By definition the length of the empty word is |A| = 0. 
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A word u is a subword (or factor) of a word w if there exist words p, q G A + such that 
w = pug. 

Denote the set of all factors (both finite and infinite) of a word W by F(W). Two 
infinite words W and V over alphabet A are said to be equivalent if F(W) = F(V). 

We say that symbol a 6 A is a left (accordingly, right) extension of factor v if av 
(accordingly, va) belongs to F(W). A subword v is called a left (accordingly, right) 
special factor if it possesses at least two left (right) extensions. A subword v is said to 
be bispecial if it is both a left and right special factor at the same time. The number 
of different left (right) extensions of a subword is called the left (right) valence of this 
subword. 

A word W is said to be recurrent if each its factor occurs in it infinitely many times 
(in the case of a doubly-infinite word, each factor occurs infinitely many times in both 
directions). A word W is said to be uniformly recurrent or (u.r word) if it is recurrent 
and, for each its factor v, there exists a positive integer N(v) such that, for any subword 
u of length at least N{y) of the word W, factor v occurs in u as a subword. 

Below we formulate several theorems about u.r words, which will be needed later. The 
proof of these theorems can be found in monograph pQ. 

Theorem 2.1 The following two properties of an infinite word W are equivalent: 

a) For any k there exists N(k) such that any segment of length k of the word W occurs 
in any segment of length N(k) of the word W ; 

b) If all finite factors of a word V are at the same time finite factors of a word W , 
then all finite factors of the word W are also finite factors of the word V . 

Theorem 2.2 Let W be an infinite word. Then there exists a uniformly recurrent word 
W all of whose factors are factors of W . □ 

One can consider the action of the shift operator r on the set of infinite words. The 
Hamming distance between words W\ and W2 is the quantity d(Wi, W2) = Sngz ^n.2 _ ' n ', 
where A n = if symbols at the n-th positions of the words are the same and A n = 1, 
otherwise. 

An invariant subset is a subset of the set of all infinite words which is invariant under 
the action of r. A minimal closed invariant set, or briefly, m.c.i.s, is a closed (with 
respect to the Hamming metric introduced above) invariant subset which is nonempty 
and contains no closed invariant subsets except for itself and the empty subset. 

Theorem 2.3 (Properties of closed invariant sets) The following properties of a word 
W are equivalent: 

1. W is a uniformly recurrent word; 

2. The closed orbit ofW is minimal and is a m.c.i.s. 

Theorem 2.4 Let W be a uniformly recurrent nonperiodic infinite word. Then 

1. All the words that are equivalent to W are u.r. words; the set of such words in 
uncountable; 

2. There exist distinct u.r. words W\ 7^ W2 which are equivalent to the given word and 
can be written as W\ = UV\, W2 = UV2, where U is a left-infinite word and V\ 7^ V2 
are right-infinite words. 
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2.2 Rauzy graphs 

It is convenient to describe a word W using the subword graphs or (Rauzy graphs). They 
were introduced by Rauzy [22J in the following way: the k-graph of the word W is the 
directed graph whose vertices biuniquely correspond to subwords of length k of the word 
W and there is an arc from vertex A to vertex B if W contains a subword of length k + 1 
such that the first k symbols of it make a subword that corresponds to A and the last 
k symbols make a subword that corresponds to B. Thus, the arcs of the fc-graph are in 
one-to-one correspondence with the (k + l)-factors of the word W. 

It is clear that, in the /c-graph G of the word W, vertices which are tails (accordingly, 
heads) of more than one arc correspond to right special words. Such vertices will be called 
crotches. Graph G is said to be strongly connected if it contains a directed path from any 
vertex to any other one. 

The follower of the directed graph G is the directed graph Fol(G) constructed in the 
following way: the vertices of the graph G biuniquely correspond to the arcs of the G and 
there is an arc from vertex A to vertex B if the head of the arc A in the graph G is at 
the notch end of B. 

The connectivity of the Rauzy graphs is naturally related to the recurrence of the 
corresponding word. Namely, the following assertion is valid. 

Proposition 2.5 Let W be a (semi)infinite word. The following conditions are equiva- 
lent: 

1. The word W is recurrent; 

2. For any k the corresponding k-graph of the word W is strongly connected; 

3. Any factor of W occurs at least twice; 
4- Any factor can be extended to the left. 

Let us introduce the notion of the labeled Rauzy graph. A Rauzy graph is said to be 
labeled if 

1. the arcs of any crotch are assigned symbols I ("left") and r ("right"); 

2. some vertices are assigned symbol 

The follower of the labeled Rauzy graph is the directed graph which is the follower 
of the latter (considered a Rauzy graph with the labeling neglected) and whose arcs are 
labeled according the following rule: 

1. Arcs that enter a crotch should be labeled by the same symbols as the arcs that 
enter any left successor of this vertex; 

2. Arcs that go out of a crotch should be labeled by the same symbols as the arcs that 
go out of any right successor of this vertex; 

3. If a vertex is labeled by symbol "-", then all its right successors should also be 
labeled by symbol "-" . 
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2.3 Words generated by dynamical systems 

Let M be a compact metric space, U C M be its open subset, / : M — > M ba a 
homeomorphism of the compact space into itself, and x G M be an initial point. 
With the sequence of iterations, one can associate an infinite binary word 

fa, /<»>(*„) e 17 

which is called the evolution of point xq. Symbolic dynamics investigates the interrelation 
between the properties of the dynamical system (M, /) and the combinatorial properties 
of the word W n . 

For words over alphabets which comprise more symbols, several characteristic sets 
should be considered: Ux, . . . , U n . 

Note that the evolution of the point is correctly defined only when the trajectory of 
the point does not pass through the boundary of the characteristic sets dUi, dUi-, ■ ■ ■■ 

In order to consider the trajectory of an arbitrary point, let us introduce the notion 
of essential evolution. 

Definition 2.6 A finite word v* is called essential finite evolution of point x* if any 
neighborhood of point x* contains an open set V such that any point x G V possesses the 
evolution v* . An infinite word W is called essential evolution of point x* if any its initial 
subword is an essential finite evolution of point x* . 

When there is no risk of ambiguity, we say evolution of the point meaning the essential 
evolution. Note that a point can have several essential evolutions. 

Proposition 2.7 ([2]) Let V be a finite word. Then the set of points with the finite 
essential evolution V is closed. A similar assertion is true for an infinite word W . 

2.4 Correspondence between words and partitions of a set 

Now let us consider the correspondence between words and subsets of M. It follows from 
the construction that, if the initial point belongs to the set Ui, then its evolution begins 
with symbol a*. Consider the images of the sets [/, under the mappings f^ 2 ^, ■ ■ ., 

n G N. 

It is clear that, if the point belongs to the set 

f { - n \u ln ) n f-^Ku^) n . . . n f^\u n ) n u io , 

then the evolution begins with the word a^a^ • • - a^ n . 

Accordingly, the number of different essential evolutions of length n + 1 is equal to the 
number of partitions of the set M into nonempty subsets by the boundaries of the sets 
dUi and their images under the mappings f^\ f^~ 2 \ • • • , f^~ n ^- 

Remark. The number of finite essential evolutions is directly related to the topologi- 
cal dimension of the set M. For instance, it is clear that, if is homeomorphic to a segment 
or a circle, then one point can belong to the boundary of at most two open subsets of M 
and, accordingly, can have only two essential evolutions. If M is homeomorphic to a part 
of the plane M 2 then there can be arbitrarily many essential evolutions. 
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2.5 Interval exchange transformations 



Interval exchange transformation is a natural generalization of the shift of a circle: in 
the case of the partition of a circle into arcs of length a and 1 — a and a shift of a, this 
transformation coincides with the two-interval exchange transformation. 

In addition, interval exchange transformation is rather important in ergodic theory, 
theory of dynamical systems, and number theory. 

Let us consider the general case: 

Suppose that the closed interval [0, 1] is partitioned into half-open intervals of lengths 
Ai, A 2 , • • • , A n and a G S n is a permutation of the set {1, 2, ... , n}. 

The intervals of the partition can be represented in terms of the lengths given above: 

Xi= y^Aj, y^Aj 

The interval exchange transformation rearranges the intervals (X\, X2, . . . , X n ) of the 
partition; result, we obtain a new partition 

(X a (l),X a (2),...,X a (n)). 
In the orientation-preserving case, transformation T associates each point x G X{ : 

T(x) — x + di, 

where 

&i = Acr(fe) — Afc. 

If the transformation inverts an interval, then, in addition, all points are symmetrically 
reflected with respect to the midpoint of this segment. 

Definition 2.8 Interval exchange transformation T is said to be regular if, for any point 
ai, where Xi = [a^, a i+ i), we have T n (aj) 7^ a^. 

The result formulated below is rather important (see [16J): 

Theorem 2.9 An interval exchange transformation is regular if and only if the trajectory 
of any point is dense everywhere in [0, 1]. 

Properties of words generated by interval exchange transformations are investigated 
using the same methods as in the case of the shift of the unit circle. Here, the main 
approach consists in considering the negative orbits of the ends of the exchanged intervals 



= di < 02 < ■ ■ ■ < flfc+l = 1, 

where 

Xi = [ai,a i+1 ), (i G {l,...,k}). 

Denote the set of ends of the exchanged intervals {ai|l <i<A; + l}by X 1 . A word 
w of length n is a subword of the evolution of point x, i.e., the infinite word U(x), if and 
only if there exists an interval I w C [0, 1] and point y G I w such that the word w is the 
concatenation of the symbols: 
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l(x)l{T(x)) ■ ■■Z(T n - 1 (x)) = w, 
where T(x) = <2j G A if and only if x G X{. 

Proposition 2.10 Let T be a regular k-interval exchange transformation. Then the evo- 
lution U(x) of any point x has the complexity function Tu^(n) = n{k — 1) + 1 for any 
n <EN. 

3 Necessary conditions for a word to be generated 
by an interval exchange transformation 

At the first stage we formulate necessary conditions for a word to be generated by an 
interval exchange transformation. Let the word W be the evolution of point x G [0, 1] for 
a /c-interval exchange transformation and characteristic sets U±, U2, ■ ■ ■ , U n . Each charac- 
teristic set Ui is a union of several disjoint open or half-open intervals. 

As was already shown in Section 12.41 subwords of length k are in one-to-one corre- 
spondence with the /c-partitions of the characteristic sets. Since a boundary point of a 
one-dimensional set can belong to the boundary of only two sets, a fc-subword of the word 
W can have at most two extensions. We obtain the first necessary condition for a word 
to be generated by interval exchange transformation: 

Proposition 3.1 Let the word W be generated by interval exchange transformation. 
Then, for a certain N, all special subwords of length at least N should have valence 
2. 

This condition is similar to the condition that, starting from a certain N, all fc-graphs 
of the word W (k > N) should have all incoming and outgoing crotches of degree 2. 

Now let us derive conditions in terms of Rauzy graphs. Suppose that the recurrent 
word W has the growth function F w {n) = Kn + L for n > N and is generated by 
interval exchange transformation. Consider the evolution of Rauzy fc-graphs of the word 
W starting with k > N + l. As has already been demonstrated, all incoming and outgoing 
crotches have degree 2; therefore, all fc-graphs have exactly K incoming and K outgoing 
crotches. 

In the minimal case described in the previous section, where Fw(ri) = n + L, Rauzy 
graphs have exactly one incoming and one outgoing crotch. If the incoming crotch coin- 
cides with the outgoing one, then the choice of the arc to be deleted from the follower 
D(G) is uniquely determined by the condition of strong connectivity of the graph. 

Let us thoroughly investigate a more interesting case, where graphs of words contain 
more than one crotch. There are several possible situations that should be considered 
when passing from graph G n to G n+ \\ 

1. Graph G n contains no linked cycles (i.e., there are no incoming crotches that are 
at the same time outgoing crotches). In this case, graph G n+ i coincides with the 
follower D(G n ). 

2. Graph G n contains one crotch which is at the same time an incoming and outgoing 
crotch. In this case, the follower graph D(G n ) has three crotches, because one crotch 
has been cloned. Therefore, the graph G n+ i is obtained from the follower D(G n ) by 
means of deleting one arc which corresponds to the minimal non-occurring word. 



11 



3. The graph contains two or more crotches which are at the same time incoming 
and outgoing crotches. Then the graph G n+ \ is obtained from D(G n ) by means of 
deleting two or more arcs which correspond to the minimal non-occurring words. 

Since the word W is recurrent, it follows from Proposition 12.51 that, as the arcs are 
deleted, the graph should remain strongly connected, i.e., it should contain a directed 
path from any vertex to any other one. 

Consider the second case in more detail. Suppose that Gk contains one double crotch. 
This means that W contains exactly one bispecial subword w of length k. Hence, there 
exist dj, cij, 0^, ai G A such that diW, ajw, wa^, and wai are factors of W. Then the (k + 1)- 
graph Gk+i is obtained from the follower by means of deleting an arc that corresponds to 
one of the four words: diWdk, diivai, ajwak, or ajwai. Consider the interval which is the 
characteristic set for the word, I w = [x w ,y w ]. 

Since w is a right special word, we have /„, C T~ 1 (I ak UJ a ,); since w is a left special 
word, we have I w C T(I a . U I a .). 

Suppose that point A e [0, 1] partitions I w into two intervals whose images lie in I ak 
and I a[ , respectively, and point B e [0, 1] partitions it into intervals whose preimages lie 
in I ai and I a ., respectively. 

The choice of the minimal non-occurring word (and, hence, the arc to be deleted) 
is determined by the mutual location of points A and B and by whether the mapping 
preserves orientation on these sets or they are reversed. There are 8 cases, which are 
divided into four pairs which correspond to similar sets of words: 
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Cases 1 and 5 correspond to the forbidden word aiwat- 
Cases 2 and 6 correspond to the forbidden word aiwai. 
Cases 3 and 7 correspond to the forbidden word ajwak- 
Cases 4 and 8 correspond to the forbidden word djwai. 

Two pairs of cases correspond to simultaneous reversal or preservation of orientation 
of the mapping on characteristic intervals; two other pairs correspond to opposite orien- 
tations. 

In the case where the interval exchange transformation preserves orientation, we have 
only two possibilities for the arc to be deleted. 

If the mapping is allowed to reverse the intervals in the process of the interval exchange 
transformation, all four cases are possible. 

Let us introduce the notion of the labeled Rauzy graph. A Rauzy graph is said to be 
labeled if 



12 



1. The arcs of any crotch are assigned symbols I ("left") and r ("right"); 

2. Some vertices are assigned symbol 

The follower of the labeled Rauzy graph is the directed graph which is the follower 
of the latter (considered a Rauzy graph with the labeling neglected) and whose arcs are 
labeled according the following rule: 

1. Arcs that enter a crotch should be labeled by the same symbols as the arcs that 
enter any left successor of this vertex; 

2. Arcs that go out of a crotch should be labeled by the same symbols as the arcs that 
go out of any right successor of this vertex; 

3. If a vertex is labeled by symbol then all its right successors should also be 
labeled by symbol 

Remark. Now let us explain the meaning of the labeled Rauzy graph. Let the arcs 
of the incoming crotch correspond to Oj and dj and symbols I and r correspond to the 
left and right set in the pair (T(I a .),T(I a .)). If symbols a k and a ; correspond to the 
arcs of the outgoing crotch, then symbols I and r appear in accordance with the "left- 
right" order in the pair (J afc , I ai ). A vertex is assigned symbol "-" if the characteristic set 
that corresponds to it belongs to an interval which is reversed in the process of interval 
exchange transformation. 

Below, we give a condition for passing from graph G n to G n+ i. 

Proposition 3.2 1. If the graph contains no double crotches that correspond to bis- 
pecial factors, then, when passing from G n to G n+ ±, we have G n+ i = D{G n ). 

2. If the vertex that corresponds to a bispecial word is not labeled by symbol then 
the arcs that correspond to the forbidden words are chosen among the pairs Ir and 
rl. 

3. If a vertex is labeled by symbol then the arcs to be deleted should be chosen 
among the pair 11 or rr. 

The evolution of labeled Rauzy graphs is said to be correct if Rules 1 and 2 arc 
complied with by all graphs in the evolution starting from G\\ the evolution is said to be 
asymptotically correct of Rules 1 and 2 are complied with starting from a certain G n . 
We say that the evolution of labeled Rauzy graphs is oriented if the /c-graphs contain no 
vertices labeled by symbol "-" . 

The definition of the asymptotically correct evolution of Rauzy graphs allows us to 
formulate the conditions for a word to be generated by interval exchange transformation. 

Proposition 3.3 A uniformly recurrent word W 

1. is generated by interval exchange transformation if the word is provided with the 
asymptotically correct evolution of labeled Rauzy graphs; 

2. is generated by orientation-preserving interval exchange transformation if the word 
is provided with the asymptotically correct oriented evolution of labeled Rauzy graphs. 

Our main result consists in replacing "if" with "if and only if" in the proposition 
formulated above. 



13 



3.1 Construction of the dynamical system 

Let us demonstrate that conditions of Theorem 13.31 are sufficient for the word to be 
generated by interval exchange transformation. First, we show that the word W that 
satisfies the hypotheses of Theorem 13.31 can be the evolution of a certain point under the 
following piecewise-continuous transformation of the segment T : I —> I: 

1. I = [xo,Xx] U [X 1 ,X 2 ] U . . . U [£ n _i,X n ], X = 0, X n = 1. 

2. I = [y , yi] U [y x , y 2 ] U . . . U [y n _i, y n ], y = 0,y n = 1. 

3. a G S n is a permutation of a set of n elements. 

4. Transformation T maps onto (2/<r(i),2/er(i)+i) in a continuous and bijective 
manner. 

Then we demonstrate that the case of piecewise-continuous transformation can be 
reduced to interval exchange transformation. 

WE construct the piecewise-continuous transformation T step by step, at the first 
iteration, we partition the interval into arbitrary subintervals which correspond to ap- 
propriate symbols. To construct the mapping, it is sufficient to determine the trajectory 
of these points and then, for reasons of recurrence, extend it by continuity to the entire 
interval. 

Notation. By virtue of continuity and bijectivity, the mappings on the intervals are 
monotonic functions. We shall consider two cases: 

1. All mappings of the intervals are increasing functions. Such transformation is said 
to be orientation-preserving. 

2. There are both increasing and decreasing mappings of the intervals. In this case, 
we say that transformation does not preserve orientation. 

Let us partition the segment / = [0, 1] into n = Card A arbitrary intervals, which will 
be regarded as characteristic sets for the symbols of alphabet A: [0, 1] = J 0l U/ a2 U. . .U/ 0n . 

The correspondence between the intervals and the symbols is defined by Proposi- 
tion [375] below. 

Let us assume that the mapping T is continuous on each set I ai , i.e., mapping T can 
be discontinuous only at the endpoints of the characteristic sets. The intervals of the 
characteristic sets (or their images) that have a common point are said to be adjacent. 

Remark. We can always enlarge the alphabet in such a way that characteristic sets 
in the extended alphabet be organized exactly as described above. The graphs of the 
A;-words in the original alphabet then correspond to 1-graphs of the extended alphabet 
and the evolutions of graphs coincide starting from this moment. 

One can directly verify the following assertion. 

Proposition 3.4 The images of two adjacent sets under transformations T and T^ 1 ) 
either are adjacent, or cannot cover the entire interval. 

Proposition 3.5 The partition intervals can be put in correspondence with the symbols of 
the alphabet in such a way that, if w is a special right 1-word and wa-i, waj are subwords, 
then sets I a . and I a . are adjacent. Similarly, if w is a special left 1-word and auw, aiw 
are subwords, then sets I a . and I ai are also adjacent. 
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Proof. Consider an arbitrary crotch in the 1-graph. Suppose that arcs that go out of this 
crotch lead to vertices that correspond to symbols a« and aj. Then we set I aj = [xq,Xi], 
I a . = [x 1 ,x 2 ], 

Since the characteristic sets are intervals, there is only one order relation that can 
be introduced on them (namely, I ai < I a . if X{ < Xj); the same order relation can be 
introduced on their images. If a pair of characteristic sets reverse their order under 
transformation T, then we say that, on this pair, the transformation changes orientation. 

Consider the images of the intervals under the mapping T _1 . It is clear that, if symbol 
Oj is not a right special 1-word and is inevitably followed by symbol aj, then I ai C T~ l (I aj ); 
if it is not a left special 1-word and is always preceded by symbol a k , then T~ l (I a .) C I ak . 

In the case where symbol ai is a right special 1-word and can be followed by symbols 
a.,- and ak, we have I ai C T~ l (I a . Ul afc ); if it is a left special one, then T _1 (J a J C I aj U/ afc . 

Denote the set of the images of the interval ends under the mapping T~ n by I n 
and denote the set of the ends of the intervals of characteristic sets by 1°, i.e., 1° = 

{xq, X\, . . . , X n }. 

As follows from considerations in Section 12.41 the set that corresponds to the word 
w = wiw 2 ■ ■ -w n is I w — I Wn n T _1 (/ t0n _ 1 ) n . . . n T^~ n+1 \l Wl ); accordingly, the set of 
boundary points of the sets that correspond to words of length n is 7° U I 1 U . . . U I^ n ~ l \ 

If a right special word is not a bispecial one (which means that it is not at the same 
time a left special one), then the location of the point that partitions this characteristic 
set does not matter and it can be chosen arbitrarily. 

In the case of orientation-preserving transformation, the partition should be in agree- 
ment with orientation-preserving rules. 

In the case where transformation does not preserve orientation, it is necessary that, 
in the process of evolution, the number of "breaks" inside the intervals being mapped be 
finite. 

Thus, we can define the images of points on a certain subset N G I. It follows from 
the construction that there exist intervals Ik = (xk,Xk+i) such that, inside these intervals, 
our transformation is monotonic. We can always extend it by continuity to a mapping 
of the interval into itself. The resulting piecewise-continuous transformation is just what 
we looked for. Let us denote it by T. Note that the initial point whose evolution is the 
desired word W = {w n } is the point of intersection for the sequence of nested intervals 
which correspond to prefixes Wo, waWi, wqW\W2, .... 

Thus, we have proved 

Theorem 3.6 For a recurrent word W to be generated by piecewise-continuous orien- 
tation-preserving transformation, it is necessary and sufficient that the word be provided 
with asymptotically correct oriented evolution of labeled Rauzy graphs. 

In the case of orient at ion- changing transformation, we have 

Theorem 3.7 For a recurrent word W to be generated by piecewise-continuous transfor- 
mation, it is necessary and sufficient that the word be provided with asymptotically correct 
evolution of labeled Rauzy graphs. 
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4 Equivalence of the set of uniformly recurrent words 
generated by piecewise— continuous transformation 
to the set of words generated by interval exchange 
transformation 

First, let us pass to the dynamics in which almost all points (in the sense of the Lebesgue 
measure) have distinct essential evolutions. Consider the essential evolutions of the points 
under transformation T. 

Consider a piecewise-continuous transformation of intervals. The theorem of existence 
of invariant measure (see [I]) yields that any mapping of a compact set has invariant 
probabilistic measure. Therefore, the mapping T has invariant measure /i and we can in- 
troduce a new semimetric d(x\, x 2 ) = MO^i, x 2)) on the segment. Note that the inequality 
fj,(xi,x 2 ) > does not imply that the points have different essential evolutions, since the 
mapping constructed can be discontinuous. Moreover we need chose a suitable measure. 

The partition U\, . . . , U n of interval is called a pure partition if following conditions 
hold: 1) for any aj characteristic set Ui is convex, i.e. Ui is closed, semienclosed or open 
interval. 2) if two points Xi,x 2 has a same color and the interval (xi, x 2 ) contain a break 
point then images T(x\) and T(x 2 ) has a different colors. 

Let U\, U 2 , . . . , U n be a partition into characteristic sets. The partition Vi, V 2 , . . . , V m 
is called subpartition if each characteristic set Ui is union of sets Vf U = U Vi 2 U V* f . 

Let W be an evolution for some partition and W = {w'j} be an evolution for its 
subpartition. It is clear that W raise from W by gluing of symbols. It is easy to see that 
every partition has a pure subpartition. 

Let W would be an uniformly recurrent word, corresponding some evolution with 
partition U, W is a word, corresponding the evolution with the same initial point and 
subpartition W . Gluing morphism of alphabets give us a natural morphism n of the words 
it : W —> W. Logically, W may not necessary uniformly recurrent, but there is an u.r. 
U such that U ^ W . Then U = tt{U) ^ W = n{W) and hence u.r via theorem 12.41 
In the corresponding to U closed orbit would be point corresponding an evolution with 
projection W. Indeed, let w be an arbitrary subword of W, it occurs in any subword of W 
of length > k(w). Hence in any subword of U of length > k(w) there exist a subword w 
such that tt(w) = w. Hence for any surrounding of zero position of W there exist V = V 
such that 7r(V) has the same surrounding. It remains to use compactness argument. 
Corresponding word W will be u.r. In the sequel we consider only pure partitions. 

Proposition 4.1 Suppose that points X\ < x 2 have the same evolution and and the par- 
tition is pure. Then any point of the interval (xi,x 2 ) has the same essential evolution. 

Proof. The fact thatxi and x 2 have the same evolution yields that T(xi) and T(x 2 ) 
have the same evolution as well and the image of the interval (xi,x 2 ) is the interval 
(T(xx),T(x 2 )). ' " □ 

Let W be an nonperiodic uniformly recurrent word generated by a piecewise-conti- 
nuous transformation of the interval. Consider the space of words with shift operator. 
For W there exist minimal invariant set N\y (see PQ). Every point V G Nw correspond 
set of points of system of intervals with corresponding essential evolution. On the space 
Nyy there exist invariant (for shift operator) probabilistic measure v ([!]). This measure 
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induce on interval system invariant probabilistic measure \i by the natural way. The more 
detailed description of this measure \i is follows. If / is an interval, corresponding to the 
word u, and N is the set of points of closed orbit of W having subword u on given position, 
then //(/) = v{N). If / has no intersection with any such intervals, then //(/) = 0. 

The measure \i is invariant and induces a semimetric. (Length of any segment is equal 
its measure.) 

Consider topology that glue point with zero distance and construct corresponding 
factor-dynamics. Obtaining map is piecewise-isometric, i.e it is interval exchange trans- 
formations. Every such glued interval is contained in maximal glued interval that the 
number of such maximal intervals is countable and their common measure is zero. Hence 
for almost every point (in the sense of \x) of our compact M has orbit without intersection 
of any such glued interval. We have constructed piecewise continuous transformation of 
system M' generating an u.r. word, W which is equivalent to W, and hence, via com- 
pactness argument, M' has a point whose essential evolution is W, because if there is a 
word with essential evolution W, then any equivalent word can be observed. 

We have proved 

Theorem 4.2 Suppose that the word W is generated by a piecewise- continuous transfor- 
mation of the interval. Then there exists a word W which is equivalent to the given one 
and is generated by interval exchange transformation. 

By theorem, this interval exchange transformation involves all words that are equiva- 
lent to W including the word W (in the sense of the essential evolution). Thus, we have 
proved the following assertion. 

Theorem 4.3 For a recurrent word W to be generated by a piecewise-continuous orien- 
tation-preserving transformation, it is necessary and sufficient that the word be provided 
with asymptotically correct oriented evolution of labeled Rauzy graphs. 

In the case of orientation-changing transformation, we have proved the following. 

Theorem 4.4 For a recurrent word W to be generated by a piecewise-continuous orien- 
tation-changing transformation, it is necessary and sufficient that the word be provided 
with asymptotically correct evolution of labeled Rauzy graphs. 
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